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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
CAGCGTGCGC CATCCTTCCC 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

□ (C) STRANDEDNESS: single 

y!3 (D) TOPOLOGY: linear 

HJ (ii) MOLECULE TYPE: DNA (genomic) 
4;; (iv) ANTI-SENSE: NO 

!S (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
? CTTTTCCTCT GGGAAGGATG GCGCACGCTG GGAGA 

[|;| 2) INFORMATION FOR SEQ ID NO : 3 : 

O (i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
GATGCACCTA CCCAGCCTCC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 
ACGGGGTACG GAGGCTGGGT AGGTGCATCT GGT 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

! (C) STRANDEDNESS: single 

I (D) TOPOLOGY: linear 

! (ii) MOLECULE TYPE: DNA (genomic) 
= (iv) ANTI-SENSE: YES 

{ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 
ACAAAGGCAT CCTGCAGTTG 



^2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 
;;1 (A) LENGTH: 3 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
CCCCCAACTG CAGGATGCCT TTGTGGAACT GTACGG 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7 
GGGAAGGATG GCGCACGCTG 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
i (D) TOPOLOGY: linear 

i (ii) MOLECULE TYPE: DNA (genomic) 
: * (iv) ANTI-SENSE: YES 

y ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 8 
-CGCGTGCGAC CCTCTTG 



fi|2) INFORMATION FOR SEQ ID NO : 9 : 

fif (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 
TACCGCGTGC GACCCTC 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TCCTACCGCG TGCGACC 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

I (C) STRANDEDNESS: single 

i (D) TOPOLOGY: linear 

J (ii) MOLECULE TYPE: DNA (genomic) 



(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
JXCTTCCTACC GCGTGCG 



l ! :[2) INFORMATION FOR SEQ ID NO: 12: 

O (i) SEQUENCE CHARACTERISTICS: 

□ (A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 



GACCCTTCCT ACCGCGT 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 
GGAGACCCTT CCTACCG 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
[ (D) TOPOLOGY: linear 

\ (ii) MOLECULE TYPE: DNA (genomic) 
I (iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 
a GCGGCGGCAG CGCGG 



Y\[2) INFORMATION FOR SEQ ID NO: 15: 

Sf (i) SEQUENCE CHARACTERISTICS: 

2 (A) LENGTH: 15 base pairs 

— (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 
CGGCGGGGCG ACGGA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 
CGGGAGCGCG GCGGGC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
i (D) TOPOLOGY: linear 

'i (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l' 
4|CTCCCAGCG TGCGCCAT 

f: i E 

U|2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
U (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 
I;;! (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

O (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 
TGCACTCACG CTCGGCCT 



(2) INFORMATION FOR SEQ ID NO : 19 : 

(i) SEQUENCE CHARACTERISTICS:^ 

(A) LENGTH: 5086 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



GCGCCCGCCC 


CTCCGCGCCG 


CCTGCCCGCC 


CGCCCGCCGC 


GCTCCCGCCC < 


GCCGCTCTCC 


O U 


GTGGCCCCGC 


CGCGCTGCCG 


CCGCCGCCGC 


TGCCAGCGAA 


GGTGCCGGGG 


CTCCGGGCCC 




TCCCTGCCGG 


CGGCCGTCAG 


CGCTCGGAGC 


GAACTGCGCG 


ACGGGAGGTC 


CGGGAGGv-GA 




CCGTAGTCGC 


GCCGCCGCGC 


AGGACCAGGA 


GGAGGAGAAA 


GGGTGCGCAG 


CCCGGAGGCG 




GGGTGCGCCG 


GTGGGGTGCA 


GCGGAAGAGG 


GGGTCCAGGG 


GGGAGAACTT 


CGTAGCAG I C 


TOO 
J u u 


ATCCTTTTTA 


GGAAAAGAGG 


GAAAAAATAA 


AACCCTCCCC 


CACCACCTCC 


TTCTCCCCAC 


i ^ n 


CCCTCGCCGC 


ACCACACACA 


GCGCGGGCTT 


CTAGCGCTCG 


GCACCGGCGG 


GCCAGGCGCG 


a o n 


TCCTGCCTTC 


ATTTATCCAG 


CAGCTTTTCG 


GAAAATGCAT 


TTGCTGTTCG 


GAo ill An 1 V- 


A 8 0 


AGAAGACGAT 


TCCTGCCTCC 


GTCCCCGGCT 


CCTTCATCGT 


CCCATCTCCC 




SAD 


CCTGGGGAGG 


CGTGAAGCGG 


TCCCGTGGAT 


AGAGATTCAT 

# 


GCCTGTGTCC 






rGCGCGCGTAT 


AAATTGCCGA 


GAAGGGGAAA 


ACATCACAGG 


ACTTCTGCGA 




660 

\J \J V/ 


JSaaaattgta 


ATTCATCTGC 


CGCCGCCGCT 


GCCAAAAAAA 


AACTCGAGCT 


nmrnr atti rp/*»rn 


7 0 0 


: GCGGTTGGGA 


TTCCTGCGGA 


TTGACATTTC 


TGTGAAGCAG 


AAGTCTGGGA 


A rp i" 1 Arnrrppr' 
Al CLrAl C 1 


7 8 0 


pftAATCCTCCT 


AATTTTTACT 


CCCTCTCCCC 


CCGACTCCTG 


ATTCATTGGG 


AACj 111 U AAA 


84 0 
u t \j 


MpCAGCTATAA 


CTGGAGAGTG 


CTGAAGATTG 


ATGGGATCGT 


TGCCTTATGC 

o 


A 1 llLxllllVj 


9 00 


HlTTTTACAAA 


AAGGAAACTT 


GACAGAGGAT 


CATGCTGTAC 


_ ■ m Tk Tfc TV TV TV m T\ 

TTAAAAAATA 


OA A n~i A A T 1 
C AACj 1 AAo 1 


9 60 


flCGCACAGGA 


AATTGGTTTA 


ATGTAACTTT 


CAATGGAAAC 


CTTTGAGATT 


mrnrprp a rrprn a a 
Ti 1 1AL1 1AA 


102 0 

X U L< W 


fAGTGCATTCG 


AGTAAATTTA 


* 

ATTTCCAGGC 


AGCTTAATAC 


ATTGTTTTTA 


GCCCjILjI Int 


108 0 

_L W W w 


TTGTAGTGTG 


TATGCCCTGC 


TTTCACTCAG 


TGTGTACAGG 


GAAACGCACC 


m a nirpn~irnn~irn A 

TGA1 1 1 1 1 1 A 


114 0 

X X T v 


CTTATTAGTT 


TGTTTTTTCT 


TTAACCTTTC 


AGCATCACAG 


AGGAAGTAGA 


/—y rn a rp a rp rn a A 

CTGA 1 A 1 1 AA 


19 00 


CAATACTTAC 


TAATAATAAC 


GTGCCTCATG 


AAATAAAGAT 


CCGAAAGGAA 


rnm/^p A a rn a a a 
11 Kj \j AA 1 AAA 


12 60 

X Ct \J v 


AATTTCCTGC 


GTCTCATGCC 


AAGAGGGAAA 


CACCAGAATC 


AAGTGTTCCG 


/■^ y*"»m /*■* A rprp/^ A A 

CGTGA1 1GAA 


1^20 


GACACCCCCT 


0 

CGTCCAAGAA 


TGCAAAGCAC 


ATCCAATAAA 


ATAGCTGGAT 


m Ik m a A orn rp 

TATAAC 




CTTCTTTCTC 


TGGGGGCCGT 


GGGGTGGGAG 


CTGGGGCGAG 


AGGTGCCGTT 


GGCCCCCG 1 1 


14 4 0 


GCTTTTCCTC 


TGGGAAGGAT 


GGCGCACGCT 


GGGAGAACGG 


GGTACGACAA 


CCGGGAGATA 


i c n n 


GTGATGAAGT 


ACATCCATTA 


TAAGCTGTCG 


CAGAGGGGCT 


ACGAGTGGGA 


TGCGGGAGAT 


1560 


GTGGGCGCCG 


CGCCCCCGGG 


GGCCGCCCCC 


: GCACCGGGCA 


TCTTCTCCTC 


: CCAGCCCGGG 


1620 


CACACGCCCC 


! ATCCAGCCGC 


I ATCCCGCGAC 


: CCGGTCGCCA GGACCTCGCC 


: GCTGCAGACC 


1680 
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# 




CCGGCGCCGC 


CGCGGGGCCT 


GCGCTCAGCC 


CGGTGCCACC 


TGTGGTCCAC 


1/40 


GCCAAGCCGG 


CGACGACTTC 


TCCCGCCGCT 


ACCGCGGCGA 


CTTCGCCGAG 


18 00 


AGCTGCACCT 


GACGCCCTTC 


ACCGCGCGGG 


GACGCTTTGC 


CACGGTGGTG 


1 o c r\ 
lb O 0 


TCAGGGACGG 


GGTGAACTGG 


GGGAGGATTG 


TGGCCTTCTT 


TGAGTTCGGT 


i q i n 

iy <i u 


GTGTGGAGAG 


CGTCAACCGG 


GAGATGTCGC 


CCCTGGTGGA 


CAACATCGCC 


i q q n 


CTGAGTACCT 


GAACCGGCAC 


CTGCACACCT 


GGATCCAGGA 


TAACGGAGGC 


*> n a n 


TTGTGGAACT 


GTACGGCCCC 


AGCATGCGGC 


CTCTGTTTGA 


TTTCTCCTGG 


z iu u 


AGACTCTGCT 


CAGTTTGGCC 


CTGGTGGGAG 


CTTGCATCAC 


CCTGGGTGCC 




ACAAGTGAAG 


TCAACATGCC 


TGCCCCAAAC 


AAATATGCAA 


AAGGTTCACT 




AAATAATATG 


CATTGTCAGT 


GATGTACCAT 


GAAACAAAGC 


TGCAGGCTGT 


z z o u 


ATAACACACA 


TATAAACATC 


ACACACACAG 


ACAGACACAC 


ACACACACAA 




TCTTCAGGCA 


AAACGTCGAA 


TCAGCTATTT 


ACTGCCAAAG 


GGAAATATCA 




ACATTATTAA 


GAAAAAAGAT 


TTATTTATTT 


AAGACAGTCC 


CATCAAAACT 


*■> /i ^ n 


AAATCCGACC 


ACTAATTGCC 


AAACACCGCT 


TCGTGTGGCT 


CCACCTGGAT 


n c o O 
Z D Z L> 


TGTAAACATA 


GATTCGCTTT 


CCATGTTGTT 


GGCCGGATCA 


CCATCTGAAG 


nco A 
ZDOU 


TGGAAAAAGG 


ACCTGATCAT 


TGGGGAAGCT 


GGCTTTCTGG 


CTGCTGGAGG 


o a n 

Z D *x U 


GGTGTTCATT 


CACTTGCATT 


TCTTTGCCCT 


GGGGGCGTGA 


TATTAACAGA 


*""T 

2700 


CCGTGGGGGG 


AAGTCCATGC 


CTCCCTGGCC 


TGAAGAAGAG 


ACTCTTTGCA 


2760 


ATGATGCATA 


CCTGGTGGGA 


GGAAAAGAGT 


TGGGAACTTC 


AGATGGACCT 


2820 



AGTACCCACT 


GAGATTTCCA 


CGCCGAAGGA 


CAGCGATGGG 


AAAAATGCCC 


TTAAATCATA 


2880 


GGAAAGTATT 


TTTTTAAGCT 


ACCAATTGTG 


CCGAGAAAAG 


CATTTTAGCA 


ATTTATACAA 


2940 


TATCATCCAG 


TACCTTAAAC 


CCTGATTGTG 


TATATTCATA 


TATTTTGGAT 


ACGCACCCCC 


3000 


CAACTCCCAA 


TACTGGCTCT 


GTCTGAGTAA 


GAAACAGAAT 


CCTCTGGAAC 


TTGAGGAAGT 


3060 


GAACATTTCG 


GTGACTTCCG 


ATCAGGAAGG 


CTAGAGTTAC 


CCAGAGCATC 


AGGCCGCCAC 


3120 


AAGTGCCTGC 


TTTTAGGAGA 


CCGAAGTCCG 


CAGAACCTAC 


CTGTGTCCCA 


GCTTGGAGGC 


3180 


CTGGTCCTGG 


AACTGAGCCG 


GGCCCTCACT 


GGCCTCCTCC 


AGGGATGATC 


AACAGGGTAG 


3240 


TGTGGTCTCC 


GAATGTCTGG 


AAGCTGATGG 


ATGGAGCTCA 


GAATTCCACT 


GTCAAGAAAG 


3300 


AGCAGTAGAG 


GGGTGTGGCT 


GGGCCTGTCA 


CCCTGGGGCC 


CTCCAGGTAG 


GCCCGTTTTC 


3360 


ACGTGG AG C A 


TAGGAGCCAC 


GACCCTTCTT 


AAGACATGTA 


TCACTGTAGA 


GGGAAGGAAC 


3420 
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3900 
3960 



AGAGGCCCTG GGCCTTCCTA TCAGAAGGAC ATGGTGAAGG CTGGGAACGT GAGGAGAGGC 3480 
AATGGCCACG GCCCATTTTG GCTGTAGCAC ATGGCACGTT GGCTGTGTGG CCTTGGCCAC 3540 
CTGTGAGTTT AAAGCAAGGC TTTAAATGAC TTTGGAGAGG GTCACAAATC CTAAAAGAAG 3 600 

CATTGAAGTG AGGTGTCATG GATTAATTGA CCCCTGTCTA TGGAATTACA TGTAAAACAT 3 660 

TATCTTGTCA CTGTAGTTTG GTTTTATTTG AAAACCTGAC AAAAAAAAAG TTCCAGGTGT 3720 
GGAATATGGG GGTTATCTGT ACATCCTGGG GCATTAAAAA AAAATCAATG GTGGGGAACT 3780 
ATAAAGAAGT AACAAAAGAA GTGACATCTT CAGCAAATAA ACTAGGAAAT TTTTTTTTCT 3840 
TCCAGTTTAG AATCAGCCTT GAAACATTGA TGGAATAACT CTGTGGCATT ATTGCATTAT 
ATACCATTTA TCTGTATTAA CTTTGGAATG TACTCTGTTC AATGTTTAAT GCTGTGGTTG 
ATATTTCGAA AGCTGCTTTA AAAAAATACA TGCATCTCAG CGTTTTTTTG TTTTTAATTG 4 02 0 

TATTTAGTTA TGGCCTATAC ACTATTTGTG AGCAAAGGTG ATCGTTTTCT GTTTGAGATT 
fJTTATCTCTT GATTCTTCAA AAGCATTCTG AGAAGGTGAG ATAAGCCCTG AGTCTCAGCT 
SjcCTAAGAAA AACCTGGATG TCACTGGCCA CTGAGGAGCT TTGTTTCAAC CAAGTCATGT 
ICATTTCCAC GTCAACAGAA TTGTTTATTG TGACAGTTAT ATCTGTTGTC CCTTTGACCT 
Igtttcttga AGGTTTCCTC GTCCCTGGGC AATTCCGCAT TTAATTCATG GTATTCAGGA 

Jttacatgcat GTTTGGTTAA ACCCATGAGA ttcattcagt taaaaatcca gatggcgaat 

UcCAGCAGA TTCAAATCTA TGGTGGTTTG ACCTTTAGAG AGTTGCTTTA CGTGGCCTGT 
SrCAACACAG ACCCACCCAG AGCCCTCCTG CCCTCCTTCC GCGGGGGCTT TCTCATGGCT 
irCCTTCAGG GTCTTCCTGA AATGCAGTGG TCGTTACGCT CCACCAAGAA AGCAGGAAAC 
CTGTGGTATG AAGCCAGACC TCCCCGGCGG GCCTCAGGGA ACAGAATGAT CAGACCTTTG 
AATGATTCTA ATTTTTAAGC AAAATATTAT TTTATGAAAG GTTTACATTG TCAAAGTGAT 
GAATATGGAA TATCCAATCC TGTGCTGCTA TCCTGCCAAA ATCATTTTAA TGGAGTCAGT 
TTGCAGTATG CTCCACGTGG TAAGATCCTC CAAGCTGCTT TAGAAGTAAC AATGAAGAAC 
GTGGACGTTT TTAATATAAA GCCTGTTTTG TCTTTTGTTG TTGTTCAAAC GGGATTCACA 
GAGTATTTGA AAAATGTATA TATATTAAGA GGTCACGGGG G CT AATTGCT AGCTGGCTGC 
CTTTTGCTGT GGGGTTTTGT TACCTGGTTT TAATAACAGT AAATGTGCCC AGCCTCTTGG 
CCCCAGAACT GTACAGTATT GTGGCTGCAC TTGCTCTAAG AGTAGTTGAT GTTGCATTTT 
CCTTATTGTT AAAAACATGT TAGAAGCAAT GAATGTATAT AAAAGC 



4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5086 
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(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 1..717 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

ATG GCG CAC GCT GGG AGA ACG GGG TAC GAC AAC CGG GAG ATA GTG ATG 48 
Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 
1 5 10 15 

AAG TAC ATC CAT TAT AAG CTG TCG CAG AGG GGC TAC GAG TGG GAT GCG 9 6 

Lvs Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 

O 20 25 30 

SgA GAT GTG GGC GCC GCG CCC CCG GGG GCC GCC CCC GCA CCG GGC ATC 14 4 

4ly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 
iy * 35 40 45 

ifTC TCC TCC CAG CCC GGG CAC ACG CCC CAT CCA GCC GCA TCC CGC GAC 19 2 

f phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 

V\ 50 55 60 

i CC G GTC GCC AGG ACC TCG CCG CTG CAG ACC CCG GCT GCC CCC GGC GCC 24 0 

=Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 

[1165 70 75 80 

ScC GCG GGG CCT GCG CTC AGC CCG GTG CCA CCT GTG GTC CAC CTG GCC 288 

«la Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 

CTC CGC CAA GCC GGC GAC GAC TTC TCC CGC CGC TAC CGC GGC GAC TTC 336 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 

100 105 HO 

GCC GAG ATG TCC AGC CAG CTG CAC CTG ACG CCC TTC ACC GCG CGG GGA 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 

115 120 125 

CGC TTT GCC ACG GTG GTG GAG GAG CTC TTC AGG GAC GGG GTG AAC TGG 

Ara Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
130 135 140 

GGG AGG ATT GTG GCC TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 

Glv Arq He Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 

145 150 155 160 



384 



432 



480 
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AGC GTC AAC 
Ser Val Asn 



ATG ACT GAG 

Met Thr Glu 



GGA GGC TGG 

Gly Gly Trp 

195 

CTG TTT GAT 
Leu Phe Asp 
210 

CTG GTG GGA 

Leu Val Gly 

225 



CGG GAG ATG 
Arg Glu Met 
165 

TAC CTG AAC 

Tyr Leu Asn 
180 

GAT GCC TTT 
Asp Ala Phe 



TTC TCC TGG 
Phe Ser Trp 



GCT TGC ATC 
Ala Cys lie 
230 



TCG CCC CTG 
Ser Pro Leu 



CGG CAC CTG 

Arg His Leu 
185 

GTG GAA CTG 
Val Glu Leu 
200 

CTG TCT CTG 

Leu Ser Leu 
215 

ACC CTG GGT 

Thr Leu Gly 



GTG GAC AAC 
Val Asp Asn 
170 

CAC ACC TGG 
His Thr Trp 



TAC GGC CCC 

Tyr Gly Pro 



AAG ACT CTG 
Lys Thr Leu 
220 

GCC TAT CTG 

Ala Tyr Leu 
235 



ATC GCC CTG 
lie Ala Leu 
175 

ATC CAG GAT 
lie Gin Asp 
190 

AGC ATG CGG 

Ser Met Arg 

205 

CTC AGT TTG 
Leu Ser Leu 



AGC CAC AAG 

Ser His Lys 



TGG 528 
Trp 



AAC 57 6 

Asn 



CCT 624 
Pro 



GCC 672 
Ala 



717 



(2) INFORMATION FOR SEQ ID NO: 21: 

% (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 39 amino acids 

(B) TYPE: amino acid 
H (D) TOPOLOGY: linear 

f; (ii) MOLECULE TYPE: protein 

Ul (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ifet Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 

M= 1 5 1° Lt> 

s Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 
n 20 25 30 

Sly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 

35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 
65 70 75 

Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 

100 1° 5 110 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
115 120 125 

Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
130 135 ^ 140 



Gly Arg lie Val Ala 
145 

Ser Val Asn Arg Glu 

165 

Met Thr Glu Tyr Leu 

180 

Gly Gly Trp Asp Ala 
195 

Leu Phe Asp Phe Ser 
210 

Leu Val Gly Ala Cys 
225 



Phe Phe Glu Phe Gly 
150 

Met Ser Pro Leu Val 

170 

Asn Arg His Leu His 

185 

Phe Val Glu Leu Tyr 
200 

Trp Leu Ser Leu Lys 
215 

lie Thr Leu Gly Ala 
230 



Gly Val Met Cys Val Glu 
155 160 

Asp Asn lie Ala Leu Trp 

175 

Thr Trp lie Gin Asp Asn 

190 

Gly Pro Ser Met Arg Pro 
205 

Thr Leu Leu Ser Leu Ala 
220 

Tyr Leu Ser His Lys 
235 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1.-615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 



ATG GCG CAC GCT GGG AGA ACG GGG TAC GAC AAC CGG GAG ATA GTG ATG 

let Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu He Val Met 

-1 5 10 15 

AAG TAC ATC CAT TAT AAG CTG TCG CAG AGG GGC TAC GAG TGG GAT GCG 

Lys Tyr He His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 

20 25 30 

GGA GAT GTG GGC GCC GCG CCC CCG GGG GCC GCC CCC GCA CCG GGC ATC 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly He 

35 40 45 

TTC TCC TCC CAG CCC GGG CAC ACG CCC CAT CCA GCC GCA TCC CGC GAC 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 

50 55 60 

CCG GTC GCC AGG ACC TCG CCG CTG CAG ACC CCG GCT GCC CCC GGC GCC 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 

65 70 75 



48 



96 



144 



192 



240 



-14- 



GCC GCG GGG CCT GCG CTC AGC CCG GTG CCA CCT GTG GTC CAC CTG GCC 
Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 



85 



90 



95 



CTC CGC CAA GCC GGC GAC GAC TTC TCC CGC CGC TAC CGC GGC GAC TTC 
Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 



100 



105 



110 



288 



336 



GCC GAG ATG TCC AGC CAG CTG CAC CTG ACG CCC TTC ACC GCG CGG GGA 
Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 



115 



120 



125 



CGC TTT GCC ACG GTG GTG GAG GAG CTC TTC AGG GAC GGG GTG AAC TGG 

Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 



130 



135 



140 



GGG AGG ATT GTG GCC TTC TTT GAG TTC GGT GGG GTC ATG TGT GTG GAG 
Gly Arg lie Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 



145 



150 



155 



160 



AGC GTC AAC CGG GAG ATG TCG CCC CTG GTG GAC AAC ATC GCC CTG TGG 
Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn lie Ala Leu Trp 



165 



170 



175 



IaTG ACT GAG TAC CTG AAC CGG CAC CTG CAC ACC TGG ATC CAG GAT AAC 
iMet Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp lie Gin Asp Asn 



180 



185 



190 



t GGA GGC TGG GTA GGT GCA TCT GGT GAT GTG AGT CTG GGC 
h;Gly Gly Trp Val Gly Ala Ser Gly Asp Val Ser Leu Gly 

J"! 19 5 2 00 



205 



384 



432 



480 



528 



576 



615 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 2 05 amino acids 

[ (B) TYPE: amino acid 

' (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Met Ala His Ala Gly Arg Thr Gly Tyr Asp Asn Arg Glu lie Val Met 
1 5 "10 15 

Lys Tyr lie His Tyr Lys Leu Ser Gin Arg Gly Tyr Glu Trp Asp Ala 

20 25 30 

Gly Asp Val Gly Ala Ala Pro Pro Gly Ala Ala Pro Ala Pro Gly lie 

35 40 45 

Phe Ser Ser Gin Pro Gly His Thr Pro His Pro Ala Ala Ser Arg Asp 
50 55 60 

Pro Val Ala Arg Thr Ser Pro Leu Gin Thr Pro Ala Ala Pro Gly Ala 



65 



70 



75 



80 



-15- 




# 



Ala Ala Gly Pro Ala Leu Ser Pro Val Pro Pro Val Val His Leu Ala 

85 90 95 

Leu Arg Gin Ala Gly Asp Asp Phe Ser Arg Arg Tyr Arg Gly Asp Phe 

100 105 HO 

Ala Glu Met Ser Ser Gin Leu His Leu Thr Pro Phe Thr Ala Arg Gly 
H5 120 125 

Arg Phe Ala Thr Val Val Glu Glu Leu Phe Arg Asp Gly Val Asn Trp 
130 135 I 40 

Gly Arg He Val Ala Phe Phe Glu Phe Gly Gly Val Met Cys Val Glu 

145 150 155 160 

Ser Val Asn Arg Glu Met Ser Pro Leu Val Asp Asn He Ala Leu Trp 

- - - 170 175 



165 



Met Thr Glu Tyr Leu Asn Arg His Leu His Thr Trp He Gin Asp Asn 

180 185 190 

Glv Gly Trp Val Gly Ala Ser Gly Asp Val Ser Leu Gly 
_ 1 195 200 205 

1(b) INFORMATION FOR SEQ ID NO: 24: 

^ (i) SEQUENCE CHARACTERISTICS: 
^ (A) LENGTH: 18 base pairs 

I1J (B) TYPE: nucleic acid 

yl (C) STRANDEDNESS: single 

s (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: YES 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied_base 

(B) LOCATION: 16. -17 . , -, j „ v ^r-^ 
(D) OTHER INFORMATION: Last two mternucleoside linkages are 

phosphorothioates 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

18 

TCTCCCAGCG TGCGCCAT 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 



(iv) ANTI-SENSE: NO 



(ix) FEATURE: , 

(A) NAME /KEY : Modif ied_base 

(D) OTHER 1 INFORMATION: Last two internucleoside linkages are 
phosphorothioates 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

18 

TGCACTCACG CTCGGCCT 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

r 

' 3 (ii) MOLECULE TYPE: Other nucleic acid; 

;;\ (A) DESCRIPTION: Synthetic DNA 

E (iv) ANTI-SENSE: YES 

1J (ix) FEATURE: 

jl (A) NAME/KEY: Modif ied_base 

I, \l] otSeR^NFO^ION: Last two internucleoside linkages are 

vh phosphorothioates 

if (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

% 2 0 

*GCGCGGCGGG CGGGCGGGCA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied__base 

(B) LOCATION: 18 . . 19 ^^^aa^ iinlfaopq are 
(D) OTHER INFORMATION: Last two internucleoside linkages are 

phosphorothioates 

-17- 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GGGCGGAGGC CGGCCGGCGG 20 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid; 
(A) DESCRIPTION: Synthetic DNA 

| (iv) ANTI-SENSE: YES 

> (ix) FEATURE: 

(A) NAME /KEY : Modif ied_base 

(B) LOCATION: 18 • . 19 

(D) OTHER INFORMATION: Last two internucleoside linkages are 
phosphor othioates 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

lAGCGGCGGCG GCGGCAGCGC 

t 

: t 

^2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 
s:i (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
M (D) TOPOLOGY: linear 

"l (ii) MOLECULE TYPE: Other nucleic acid; 

(A) DESCRIPTION: Synthetic DNA 

(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied — base 

(B) LOCATION: 18 . . 19 . . 

(D) OTHER INFORMATION: Last two internucleoside linkages a 

phosphorothioates 



20 



re 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

20 

GGGCCGGGAA GGGCGCCCGC 



-18- 



